Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 50
Filtrar
1.
J Biomed Inform ; 145: 104464, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37541406

RESUMO

OBJECTIVE: We explore the framing of literature-based discovery (LBD) as link prediction and graph embedding learning, with Alzheimer's Disease (AD) as our focus disease context. The key link prediction setting of prediction window length is specifically examined in the context of a time-sliced evaluation methodology. METHODS: We propose a four-stage approach to explore literature-based discovery for Alzheimer's Disease, creating and analyzing a knowledge graph tailored to the AD context, and predicting and evaluating new knowledge based on time-sliced link prediction. The first stage is to collect an AD-specific corpus. The second stage involves constructing an AD knowledge graph with identified AD-specific concepts and relations from the corpus. In the third stage, 20 pairs of training and testing datasets are constructed with the time-slicing methodology. Finally, we infer new knowledge with graph embedding-based link prediction methods. We compare different link prediction methods in this context. The impact of limiting prediction evaluation of LBD models in the context of short-term and longer-term knowledge evolution for Alzheimer's Disease is assessed. RESULTS: We constructed an AD corpus of over 16 k papers published in 1977-2021, and automatically annotated it with concepts and relations covering 11 AD-specific semantic entity types. The knowledge graph of Alzheimer's Disease derived from this resource consisted of ∼11 k nodes and ∼394 k edges, among which 34% were genotype-phenotype relationships, 57% were genotype-genotype relationships, and 9% were phenotype-phenotype relationships. A Structural Deep Network Embedding (SDNE) model consistently showed the best performance in terms of returning the most confident set of link predictions as time progresses over 20 years. A huge improvement in model performance was observed when changing the link prediction evaluation setting to consider a more distant future, reflecting the time required for knowledge accumulation. CONCLUSION: Neural network graph-embedding link prediction methods show promise for the literature-based discovery context, although the prediction setting is extremely challenging, with graph densities of less than 1%. Varying prediction window length on the time-sliced evaluation methodology leads to hugely different results and interpretations of LBD studies. Our approach can be generalized to enable knowledge discovery for other diseases. AVAILABILITY: Code, AD ontology, and data are available at https://github.com/READ-BioMed/readbiomed-lbd.


Assuntos
Doença de Alzheimer , Descoberta do Conhecimento , Humanos , Descoberta do Conhecimento/métodos , Doença de Alzheimer/diagnóstico , Redes Neurais de Computação , Aprendizagem , Fenótipo
2.
J Biomed Inform ; 143: 104362, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37146741

RESUMO

Scientific literature presents a wealth of information yet to be explored. As the number of researchers increase with each passing year and publications are released, this contributes to an era where specialized fields of research are becoming more prevalent. As this trend continues, this further propagates the separation of interdisciplinary publications and makes keeping up to date with literature a laborious task. Literature-based discovery (LBD) aims to mitigate these concerns by promoting information sharing among non-interacting literature while extracting potentially meaningful information. Furthermore, recent advances in neural network architectures and data representation techniques have fueled their respective research communities in achieving state-of-the-art performance in many downstream tasks. However, studies of neural network-based methods for LBD remain to be explored. We introduce and explore a deep learning neural network-based approach for LBD. Additionally, we investigate various approaches to represent terms as concepts and analyze the affect of feature scaling representations into our model. We compare the evaluation performance of our method on five hallmarks of cancer datasets utilized for closed discovery. Our results show the chosen representation as input into our model affects evaluation performance. We found feature scaling our input representations increases evaluation performance and decreases the necessary number of epochs needed to achieve model generalization. We also explore two approaches to represent model output. We found reducing the model's output to capturing a subset of concepts improved evaluation performance at the cost of model generalizability. We also compare the efficacy of our method on the five hallmarks of cancer datasets to a set of randomly chosen relations between concepts. We found these experiments confirm our method's suitability for LBD.


Assuntos
Aprendizado Profundo , Neoplasias , Humanos , Redes Neurais de Computação , Descoberta do Conhecimento/métodos , Publicações
3.
J Biomed Inform ; 142: 104383, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37196989

RESUMO

OBJECTIVE: To demonstrate and develop an approach enabling individual researchers or small teams to create their own ad-hoc, lightweight knowledge bases tailored for specialized scientific interests, using text-mining over scientific literature, and demonstrate the effectiveness of these knowledge bases in hypothesis generation and literature-based discovery (LBD). METHODS: We propose a lightweight process using an extractive search framework to create ad-hoc knowledge bases, which require minimal training and no background in bio-curation or computer science. These knowledge bases are particularly effective for LBD and hypothesis generation using Swanson's ABC method. The personalized nature of the knowledge bases allows for a somewhat higher level of noise than "public facing" ones, as researchers are expected to have prior domain experience to separate signal from noise. Fact verification is shifted from exhaustive verification of the knowledge base to post-hoc verification of specific entries of interest, allowing researchers to assess the correctness of relevant knowledge base entries by considering the paragraphs in which the facts were introduced. RESULTS: We demonstrate the methodology by constructing several knowledge bases of different kinds: three knowledge bases that support lab-internal hypothesis generation: Drug Delivery to Ovarian Tumors (DDOT); Tissue Engineering and Regeneration; Challenges in Cancer Research; and an additional comprehensive, accurate knowledge base designated as a public resource for the wider community on the topic of Cell Specific Drug Delivery (CSDD). In each case, we show the design and construction process, along with relevant visualizations for data exploration, and hypothesis generation. For CSDD and DDOT we also show meta-analysis, human evaluation, and in vitro experimental evaluation. CONCLUSION: Our approach enables researchers to create personalized, lightweight knowledge bases for specialized scientific interests, effectively facilitating hypothesis generation and literature-based discovery (LBD). By shifting fact verification efforts to post-hoc verification of specific entries, researchers can focus on exploring and generating hypotheses based on their expertise. The constructed knowledge bases demonstrate the versatility and adaptability of our approach to versatile research interests. The web-based platform, available at https://spike-kbc.apps.allenai.org, provides researchers with a valuable tool for rapid construction of knowledge bases tailored to their needs.


Assuntos
Mineração de Dados , Descoberta do Conhecimento , Humanos , Mineração de Dados/métodos , Descoberta do Conhecimento/métodos , Publicações
5.
J Community Psychol ; 49(6): 1718-1731, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34004017

RESUMO

Large amounts of text-based data, like study abstracts, often go unanalyzed because the task is laborious. Natural language processing (NLP) uses computer-based algorithms not traditionally implemented in community psychology to effectively and efficiently process text. These methods include examining the frequency of words and phrases, the clustering of topics, and the interrelationships of words. This article applied NLP to explore the concept of equity in community psychology. The COVID-19 crisis has made pre-existing health equity gaps even more salient. Community psychology has a specific interest in working with organizations, systems, and communities to address social determinants that perpetuate inequities by refocusing interventions around achieving health and wellness for all. This article examines how community psychology has discussed equity thus far to identify strengths and gaps for future research and practice. The results showed the prominence of community-based participatory research and the diversity of settings researchers work in. However, the total number of abstracts with equity concepts was lower than expected, which suggests there is a need for a continued focus on equity.


Assuntos
Psiquiatria Comunitária/métodos , Pesquisa Participativa Baseada na Comunidade/métodos , Equidade em Saúde/estatística & dados numéricos , Descoberta do Conhecimento/métodos , Processamento de Linguagem Natural , Determinantes Sociais da Saúde/estatística & dados numéricos , Humanos , Publicações Periódicas como Assunto
6.
Br J Soc Psychol ; 60(1): 1-28, 2021 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-33616965

RESUMO

The COVID-19 pandemic points to the need for scientists to pool their efforts in order to understand this disease and respond to the ensuing crisis. Other global challenges also require such scientific cooperation. Yet in academic institutions, reward structures and incentives are based on systems that primarily fuel the competition between (groups of) scientific researchers. Competition between individual researchers, research groups, research approaches, and scientific disciplines is seen as an important selection mechanism and driver of academic excellence. These expected benefits of competition have come to define the organizational culture in academia. There are clear indications that the overreliance on competitive models undermines cooperative exchanges that might lead to higher quality insights. This damages the well-being and productivity of individual researchers and impedes efforts towards collaborative knowledge generation. Insights from social and organizational psychology on the side effects of relying on performance targets, prioritizing the achievement of success over the avoidance of failure, and emphasizing self-interest and efficiency, clarify implicit mechanisms that may spoil valid attempts at transformation. The analysis presented here elucidates that a broader change in the academic culture is needed to truly benefit from current attempts to create more open and collaborative practices for cumulative knowledge generation.


Assuntos
Comunicação Interdisciplinar , Colaboração Intersetorial , Descoberta do Conhecimento , Ciência/educação , Currículo , Eficiência , Humanos , Descoberta do Conhecimento/métodos , Pesquisa/educação
7.
PLoS One ; 16(2): e0244618, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33571223

RESUMO

Just like everything in nature, scientific topics flourish and perish. While existing literature well captures article's life-cycle via citation patterns, little is known about how scientific popularity and impact evolves for a specific topic. It would be most intuitive if we could 'feel' topic's activity just as we perceive the weather by temperature. Here, we conceive knowledge temperature to quantify topic overall popularity and impact through citation network dynamics. Knowledge temperature includes 2 parts. One part depicts lasting impact by assessing knowledge accumulation with an analogy between topic evolution and isobaric expansion. The other part gauges temporal changes in knowledge structure, an embodiment of short-term popularity, through the rate of entropy change with internal energy, 2 thermodynamic variables approximated via node degree and edge number. Our analysis of representative topics with size ranging from 1000 to over 30000 articles reveals that the key to flourishing is topics' ability in accumulating useful information for future knowledge generation. Topics particularly experience temperature surges when their knowledge structure is altered by influential articles. The spike is especially obvious when there appears a single non-trivial novel research focus or merging in topic structure. Overall, knowledge temperature manifests topics' distinct evolutionary cycles.


Assuntos
Descoberta do Conhecimento/métodos , Publicações/tendências , Pesquisa/tendências , Conhecimento , Modelos Teóricos , Cultura Popular , Projetos de Pesquisa/tendências
13.
BMJ Health Care Inform ; 28(1)2021 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-33419870

RESUMO

INTRODUCTION: Numerous scientific journal articles related to COVID-19 have been rapidly published, making navigation and understanding of relationships difficult. METHODS: A graph network was constructed from the publicly available COVID-19 Open Research Dataset (CORD-19) of COVID-19-related publications using an engine leveraging medical knowledge bases to identify discrete medical concepts and an open-source tool (Gephi) to visualise the network. RESULTS: The network shows connections between diseases, medications and procedures identified from the title and abstract of 195 958 COVID-19-related publications (CORD-19 Dataset). Connections between terms with few publications, those unconnected to the main network and those irrelevant were not displayed. Nodes were coloured by knowledge base and the size of the node related to the number of publications containing the term. The data set and visualisations were made publicly accessible via a webtool. CONCLUSION: Knowledge management approaches (text mining and graph networks) can effectively allow rapid navigation and exploration of entity inter-relationships to improve understanding of diseases such as COVID-19.


Assuntos
Inteligência Artificial , COVID-19/epidemiologia , Descoberta do Conhecimento/métodos , Publicações Periódicas como Assunto/estatística & dados numéricos , Humanos , Processamento de Linguagem Natural , SARS-CoV-2
15.
Nature ; 587(7833): 240-245, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-33177664

RESUMO

The Zoonomia Project is investigating the genomics of shared and specialized traits in eutherian mammals. Here we provide genome assemblies for 131 species, of which all but 9 are previously uncharacterized, and describe a whole-genome alignment of 240 species of considerable phylogenetic diversity, comprising representatives from more than 80% of mammalian families. We find that regions of reduced genetic diversity are more abundant in species at a high risk of extinction, discern signals of evolutionary selection at high resolution and provide insights from individual reference genomes. By prioritizing phylogenetic diversity and making data available quickly and without restriction, the Zoonomia Project aims to support biological discovery, medical research and the conservation of biodiversity.


Assuntos
Conservação dos Recursos Naturais , Eutérios/classificação , Eutérios/genética , Variação Genética , Genômica/métodos , Descoberta do Conhecimento , Animais , Biodiversidade , Pesquisa Biomédica , Conservação dos Recursos Naturais/métodos , Evolução Molecular , Extinção Biológica , Especiação Genética , Humanos , Infecções , Descoberta do Conhecimento/métodos , Perda de Heterozigosidade , Neoplasias , Filogenia , Medição de Risco , Seleção Genética , Alinhamento de Sequência , Especificidade da Espécie , Peçonhas
16.
Nurs Philos ; 21(3): e12309, 2020 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-32537914

RESUMO

To revitalize nursing science, there is a need for a new approach to guide nurse scientists in addressing complex problems in health care. By applying theoretical concepts from a revolutionary philosopher of science, Paul K. Feyerabend, new nursing knowledge can be produced using creativity and pluralistic approaches. Feyerabend proposed that methods within and outside of science can produce knowledge. Despite the recognition of Feyerabendian philosophy within science, there is currently a lack of literature regarding the relevance of Feyerabendian philosophy for nursing science. We aim to (a) describe and critique Feyerabendian concepts, (b) discuss the potential application of Feyerabendian philosophy for knowledge production within gerontological nursing and (c) describe theoretical possibilities for nurse scientists in using Feyerabendian philosophy to guide nursing knowledge development. We begin by introducing Feyerabend's life and his inspirations for his theoretical concepts, epistemological anarchism, theoretical pluralism and humanitarianism, and conclude by offering suggestions of how to apply Feyerabendian philosophy in nursing research.


Assuntos
Descoberta do Conhecimento/métodos , Enfermagem/métodos , Filosofia , Humanos , Enfermagem/tendências
17.
PLoS One ; 15(6): e0233879, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32544200

RESUMO

Although a great deal of attention has been paid to how conspiracy theories circulate on social media, and the deleterious effect that they, and their factual counterpart conspiracies, have on political institutions, there has been little computational work done on describing their narrative structures. Predicating our work on narrative theory, we present an automated pipeline for the discovery and description of the generative narrative frameworks of conspiracy theories that circulate on social media, and actual conspiracies reported in the news media. We base this work on two separate comprehensive repositories of blog posts and news articles describing the well-known conspiracy theory Pizzagate from 2016, and the New Jersey political conspiracy Bridgegate from 2013. Inspired by the qualitative narrative theory of Greimas, we formulate a graphical generative machine learning model where nodes represent actors/actants, and multi-edges and self-loops among nodes capture context-specific relationships. Posts and news items are viewed as samples of subgraphs of the hidden narrative framework network. The problem of reconstructing the underlying narrative structure is then posed as a latent model estimation problem. To derive the narrative frameworks in our target corpora, we automatically extract and aggregate the actants (people, places, objects) and their relationships from the posts and articles. We capture context specific actants and interactant relationships by developing a system of supernodes and subnodes. We use these to construct an actant-relationship network, which constitutes the underlying generative narrative framework for each of the corpora. We show how the Pizzagate framework relies on the conspiracy theorists' interpretation of "hidden knowledge" to link otherwise unlinked domains of human interaction, and hypothesize that this multi-domain focus is an important feature of conspiracy theories. We contrast this to the single domain focus of an actual conspiracy. While Pizzagate relies on the alignment of multiple domains, Bridgegate remains firmly rooted in the single domain of New Jersey politics. We hypothesize that the narrative framework of a conspiracy theory might stabilize quickly in contrast to the narrative framework of an actual conspiracy, which might develop more slowly as revelations come to light. By highlighting the structural differences between the two narrative frameworks, our approach could be used by private and public analysts to help distinguish between conspiracy theories and conspiracies.


Assuntos
Enganação , Descoberta do Conhecimento/métodos , Narração , Política , Mídias Sociais , Software , Análise de Dados , Humanos , Aprendizado de Máquina , Estados Unidos
18.
PLoS One ; 15(5): e0232891, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32413059

RESUMO

Literature-based Discovery (LBD) aims to discover new knowledge automatically from large collections of literature. Scientific literature is growing at an exponential rate, making it difficult for researchers to stay current in their discipline and easy to miss knowledge necessary to advance their research. LBD can facilitate hypothesis testing and generation and thus accelerate scientific progress. Neural networks have demonstrated improved performance on LBD-related tasks but are yet to be applied to it. We propose four graph-based, neural network methods to perform open and closed LBD. We compared our methods with those used by the state-of-the-art LION LBD system on the same evaluations to replicate recently published findings in cancer biology. We also applied them to a time-sliced dataset of human-curated peer-reviewed biological interactions. These evaluations and the metrics they employ represent performance on real-world knowledge advances and are thus robust indicators of approach efficacy. In the first experiments, our best methods performed 2-4 times better than the baselines in closed discovery and 2-3 times better in open discovery. In the second, our best methods performed almost 2 times better than the baselines in open discovery. These results are strong indications that neural LBD is potentially a very effective approach for generating new scientific discoveries from existing literature. The code for our models and other information can be found at: https://github.com/cambridgeltl/nn_for_LBD.


Assuntos
Descoberta do Conhecimento/métodos , Redes Neurais de Computação , Mineração de Dados/métodos , Humanos , Neoplasias/metabolismo , Reconhecimento Automatizado de Padrão/métodos , Revisão por Pares , Comunicação Acadêmica
19.
Adv Exp Med Biol ; 1194: 181-191, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32468534

RESUMO

The exponential growth of the number and variety of IoT devices and applications for personal use, as well as the improvement of their quality and performance, facilitates the realization of intelligent eHealth concepts. Nowadays, it is easier than ever for individuals to monitor themselves, quantify, and log their everyday activities in order to gain insights about their body's performance and receive recommendations and incentives to improve it. Of course, in order for such systems to live up to the promise, given the treasure trove of data that is collected, machine learning techniques need to be integrated in the processing and analysis of the data. This systematic and automated quantification, logging, and analysis of personal data, using IoT and AI technologies, have given birth to the phenomenon of Quantified-Self. This work proposes a prototype decentralized Quantified-Self application, built on top of a dedicated IoT gateway that aggregates and analyzes data from multiple sources, such as biosignal sensors and wearables, and performs analytics on it.


Assuntos
Descoberta do Conhecimento , Monitorização Fisiológica , Monitores de Aptidão Física/normas , Monitores de Aptidão Física/tendências , Humanos , Descoberta do Conhecimento/métodos , Aprendizado de Máquina , Monitorização Fisiológica/instrumentação , Monitorização Fisiológica/métodos , Telemedicina
20.
Int J Health Care Qual Assur ; 33(2): 221-234, 2020 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-32233355

RESUMO

PURPOSE: Incident reporting systems are commonly deployed in healthcare but resulting datasets are largely warehoused. This study explores if intelligence from such datasets could be used to improve quality, efficiency, and safety. DESIGN/METHODOLOGY/APPROACH: Incident reporting data recorded in one NHS acute Trust was mined for insight (n = 133,893 April 2005-July 2016 across 201 fields, 26,912,493 items). An a priori dataset was overlaid consisting of staffing, vital signs, and national safety indicators such as falls. Analysis was primarily nonlinear statistical approaches using Mathematica V11. FINDINGS: The organization developed a deeper understanding of the use of incident reporting systems both in terms of usability and possible reflection of culture. Signals emerged which focused areas of improvement or risk. An example of this is a deeper understanding of the timing and staffing levels associated with falls. Insight into the nature and grading of reporting was also gained. PRACTICAL IMPLICATIONS: Healthcare incident reporting data is underused and with a small amount of analysis can provide real insight and application to patient safety. ORIGINALITY/VALUE: This study shows that insight can be gained by mining incident reporting datasets, particularly when integrated with other routinely collected data.


Assuntos
Mineração de Dados/métodos , Descoberta do Conhecimento/métodos , Gestão de Riscos/métodos , Humanos , Segurança do Paciente , Qualidade da Assistência à Saúde/organização & administração , Gestão da Segurança , Medicina Estatal , Reino Unido
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...